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Abstract 

We present and study a probabilistic neural automaton in which the 
fraction of simultaneously-updated neurons is a parameter, p e (0, 1) . 
For small p, there is relaxation towards one of the attractors and a 
great sensibility to external stimuli and, for p > Pc, itinerancy among 
attractors. Tuning p in this regime, oscillations may abruptly change 
from regular to chaotic and vice versa, which allows one to control the 
efficiency of the searching process. We argue on the similarity of the 
model behavior with recent observations and on the possible role of 
chaos in neurobiology. 

1 Introduction 

Attractor neural networks (ANN) are a paradigm for the property of associa- 
tive memory ( Hopfield, 1982[|Amit, 1989 ). Nevertheless, concerning practi- 



cal applications, and also when trying to mold the essence of actual systems, 
the utility of ANN is severely limited, mainly by the fact that they can only 
retrieve one memory at the time. In this note we show that such a limita- 
tion may be systematically overcome by simply generalizing familiar model 
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situations. More specifically, we here extend some of our recent work on 



Torres et al., 2007 



ANN with fast pre-synaptic noise ( Cortes et al., 2006 
Marro et al., 2007] ). The result is a novel mathematically-tractable ANN 



whose activity eventually describes heteroclinic paths among the attractors. 
This illustrates, in particular, the possibility of a constructive role of chaos 
during searching processes. 

Our previous related studies essentially considered the same model sys- 
tem but two different ways of updating it, namely, (i) sequential and (ii) 
parallel updating. Interesting enough, the ensuing behavior was qualita- 
tively, even dramatically different. That is, the main observation was, re- 
spectively, (i) a great enhancement of the system sensibility to external 
stimuli as a consequence of rapid synaptic fluctuations which simulate facil- 



itation and/or depression ( [Cortes et al., 2006} [Torres et al., 2007D , and {ii) 



chaotic behavior while the system spontaneously visited all the available 
attractors (Marro et al., 2007). Each of these two regimes of behavior is 
to be associated with a different functionality of an essential dynamic in- 
stability. Such an important dependence on the updating process is rather 
unexpected. For instance, we checked that it does not occur in a recent 
model ( Pantic et al. , 2002 Pantic et al. , 2003 ) which is based on a different 
depression mechanism. This situation motivated us to study in detail the 
changeover between (i) and (ii) as a modification of our previously proposed 
ANN ( [Cortes et al., 2006tparro et al., 2007[ ). That is, we here present neu- 
ral automata in which the number or density p of neurons that are updated 
at each time step is a parameter. The resulting behavior as one modifies p 
is varied and intriguing. It leads us to argue on the possible relevance of our 
observations to interpret neurobiological experiments. 



2 Definition of model 

Let the sets of neuron activities a = {ai} and synaptic weights w = {wij G M} , 
where i,j = 1, . . . ,N, and assume a presynaptic current hi (cr, w) on each 
neuron due to the weighted action of the others. At each time unit, one 
updates the activity of n neurons, 1 ^ n ^ A^. This induces evolution in 
discrete time, t, of the state probability distribution according to 

Pi+i(a) = (1) 

/ 

a 
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where the transition rate is a superposition: 



^ {i\xi=l} {i\x^=0} 



8. 



and we denote x 



Here, (pn {ai cr'^) = {en ^ cr^ 1 + - 1 

{xi = 0, 1} an extra set of indexes which helps one in selecting the desired 
subset of neurons. The above thus describes parallel updating, as in familiar 



cellular automata (Chopard and Droz, 1998), for n = N or, macroscopically. 



p = n/N —> 1, while updating proceeds sequentially, as in kinetic Ising-like 



models ( |Marro and Dickman, 1999 ), for n = 1 or /) — > 0. 

We shall consider explicitly the simplest version of this model which 
happens to be both interesting and mathematically tractable. First, we as- 
sume binary neurons, so that = ±1, which is known to be sufficient in 



order to capture the essentials of cooperative processes (Pantic et al., 2002 



Marro and Dickman, 1999; Abbott and Kepler, 1990). The elementary rate 



(p is an arbitrary function of (3aihi (with (5 an inverse "temperature" or 
stochasticity parameter) which we assume to satisfy detailed balance. This 
property is not fulfilled by the superposition ([2]) for n > 1, however. Conse- 
quently, the resulting steady states are generally out of equilibrium, which is 
more realistic in practice than thermodynamic equilibrium ( [Marro and Dickman, 1999 ) 



On the other hand, we shall only illustrate the case in which the n neurons 

are chosen at random out from the set of so that one has pn (x) = 
-1 

5 Xi — n) in ([2|). For the sake of simplicity, we also need to as- 



N 



sume that the currents are such that hi {a, w) = /i [tt (a) , ^j] , where = 
{^^ = zbl;// = l,..., M} are some given, stored patterns (realizations of the 
set of activities) and vr = {tt'^ (f)}. Here, t:^ (a) = -/V'^^j^fcTj measures 
the overlap between the current state and pattern fi. For N ^ oo and finite 
M, i.e., in the limit a = M/N (which is not the interesting case, but 
may serve first for illustrative purposes) the resulting time equation under 
these conditions is t^i_^i (o") = pN~^ tanh (/i*) + {I — p) tt^ (a) , where 

hj = (3hi [nt (a) , ^j] , for any p. The above result is general and valid for any 
type of patterns. It is to be noticed that the sum over i in this map can be 
replaced by an average over the distribution of patterns p(Cf ). This permits 
a simple derivation of mean-field dynamical equations for the overlaps, at 
least for finite M. Note also that Monte Carlo simulations do not require 
restriction concerning the nature of the stored patterns. 

The above allows for different relations between the currents hi and the 
weights Wij, and between these and other system properties. The simplest 
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realization corresponds to the Hopfield case ( Hopfield, 1982^ which follows 
from the map above for p ^ and currents given by hi {a, w) = Ylj^i '^ij'^i 



with the weights fixed according to the Hebb prescription, namely, m 
7V"^ Y,^ iiiy The symmetry Wij = Wji then assures Pt^oo (o") oc exp (/? J2i ^i^i) 
and, for high enough P, the stored patterns ^ are attractors of dynam- 
ics (Amit, 1989). We checked that, in agreement with some indications 
(Herz and Marcus, 1993), the Hopfield~Hebb network exhibits associative 
memory for any p > 0. However, the situation is more complex, e.g., it 
depends on p, as one goes beyond Hopfield-Hebb, as we show in the next 
section. 

It is well documented that transmission of information and computa- 
tions in the brain are correlated with activity-induced fast fluctuations of 
synapses, i.e., our Wij^s ( Ferster, 1996 Dobrunz and Stevens, 1997 Abbott and Regehr, 2004 ). 
More specifically, it has been observed that there is some efficacy lost af- 
ter heavy work, so that synapses suffer from depression; it is claimed that 
repeated activation decreases the neurotransmitter release which depresses 



the synaptic response (Tsodyks et al., 1998 


Thomson and Deuchars, 1994 


Abbott et al., 1997; Thomson et al., 2002; Cook et al., 2003 


). The conse- 


quences of this have already been analyzed in various contexts 


Pantic et al., 2002 


Cook et al, 2003 Bibitchkov et al., 2002 Cortes et al., 2006 


Marro et al., 2007 


Torres et al., 2007), and a main general conclusion from these studies is that 



depression importantly affects a network performance reducing, in particu- 
lar, the stability of the attractors. Motivated by these facts, we shall adopt 
here the Hopfield currents and the following prescription for the synaptic 
weights: 

Wi, = [l-{l-^)q{7T)]N-'Yl (3) 



where q (vr) = vr^ (cr)^ . Note here that, in addition of static quenched 

disorder as in the standard Hopfield model, the weights ([3|) include a time 
dependence through the overlap vector vr which is a measure of the network 
firing activity. These weights, which reduce to the Hebb prescription for 
$ = 1, amount to assume short-term fluctuations which change synapses by 
a factor $ on the average with a probability g(7r). Therefore, any positive 
<I> < 1 simulates synaptic depression if q (vr) is large. This is in agreement 
with the fact that, the greater vr is, more activity will in the average arrive 
to a particular postsynaptic neuron i in the network and, therefore, this 
neuron will be more depressed. Although the magnitude q (vr) involves a 
sum over all stored patterns, this will only affect neurons that are active 
in a particular pattern for not too high correlated patterns. More details 
concerning these assertions are in (Cortes et al., 2006 Marro et al., 2007). 
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Our setting here is rather close to the one in previous treatments of 
depressing synapses in a cooperative environment. As a matter of fact, one 



may show after some simple algebra that the model in (Pantic et al., 2002 



Torres et al., 2002 Pantic et al., 2003 ) corresponds to certain choices of ^ 



and q (it) in ^ concerning steady states. For instance, a possible choice 
for M = 1 and p = lis$ = l- 7/70 and ^(vr) = ^l°^7-l"^)+47+4 where 
7 is the depression parameter defined in ( Torres et al., 2002| ) and 70 is the 
value for that parameter at which $ = 0. This type of nonlinearity in q (vr) , 
however, induces less susceptibility than the choice we are using here (see 
next section). 

For the sake of completeness, we shall be concerned in this paper with 
both positive and negative values of A result is that the behavior we are 
looking for ensues in any of these cases (but only for certain values of 



3 Some main results 

In the limite — > 00 the (nonequilibrium) stationary state follows from 
the map for M = 1 as tToo = F [tToo] P, ^) ; and local stability requires that 
\dF/d7r\ < 1; F{tt;p,<^>) = ptanh{/?7r [l - (1 - vr^] } + (1 - p) vr. The 
fixed point is therefore independent of p, but stability demands that p < pc 
with 



Pc = 2{ 3(37rl 
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+ 1 (4) 



(a condition that cannot be fulfilled in the Hopfield, ^ = 1 case). As Fig. [T] 
shows, p = Pc marks the period-doubling route to chaos in the saddle-point 
map. This behavior is confirmed numerically for M ^ 1 stored arbitrary 
patterns, as shown numerically below. 

Fig. [5] shows some typical stationary Monte Carlo runs, i.e., from bot- 
tom to top: (a) convergence towards one attractor — in fact, one of the 
antipatterns, namely, the negative of one of the given patterns — for small 
p; (b) fully irregular behavior with positive Lyapunov exponent for p > p^, 
(c) regular oscillation between one attractor and its negative for p > p^, (d) 
onset of chaos as p is further increased; and (e) rapid and ordered periodic 
oscillations between one pattern and its antipattern when, finally, all the 
neurons are active. The cases (b) and {d) are examples of instability-induced 
switching phenomena, in which the system activity chaotically visits differ- 
ent attractors by describing heteroclinic paths and remaining different time 
intervals in the neighborhood of each attractor. This kind of behavior was 



previously observed for /) — > 1 at certain values of ^ (Cortes et al., 2006). 
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The interesting new facts are that this requires a minimum of synchronized 
neurons, that this minimum — as well as many other details — depend on 

and that, as we show in the caption of figure [H varying p above pc (^) 
seems to induce further intriguing qualitative changes. 

It is also to be remarked that chaotic switching or itinerancy requires that 



the system is in a specially susceptible state first described in (Cortes et al., 2006 



Torres et al., 2007). This is accomplished in the present case by means of 



the activity-dependent fast noise modelled in ([3]). One should expect that 
variations of this assumption on the weights may result in an equivalent 
susceptible state. As a matter of fact, we found that changing the sign 
of $ does not affect our main observations. However, the case ^> = 1, in 
which the weights are fixed, does not exhibit interesting behavior, and p 
turns then into an irrelevant parameter. On the other hand, the model in 
qPantic et al., 2002[ [Torres et al., 2002t [Pantic et al., 2003D does not seem 
to involve sufficient susceptibility for the purpose (see figure [3]), in spite of 
the fact that it includes an activity-dependent depression mechanism. The 
explanation is the following. Assuming that the dynamics can be writen as 
T^t+i = G{'Kt), the gain function G(7r) in the model in (Torres et al., 2002) is 
a nonlinear one which behaves monotonically for all values of the depression 
parameter. In our case, however, a non-monotonic type of gain function 
occurs for some values of $ and p (see comparison in figure U]) . This has 
been reported to be important to originate a chaotic dynamics among the 
attr actors ( Dominguez and Theumann, 1997 Caroppo et al., 1999 ). 

Monitoring activity trajectories as one varies p in the case of several 
stored patterns provides the following qualitative picture for arbitrary pat- 
terns. As far as p < pc, the activity remains wandering around one of the 
patterns. The pattern selected depends on the initial condition, and the tra- 
jectory visits a neighborhood of it whose volume increases slightly with p. 
The trajectory seems to tend to densely fill this volume with time. Increas- 
ing p, however, the system may escape from the initially chosen pattern and, 
eventually, will tend to visit all the patterns. In addition, one observes that 
the trajectory is rather structured. That is, there are many jumps between 
the more correlated patterns but only very few to the less correlated ones 
if the system is close to the edge of chaos, and the system attention to all 
the patterns tends to be balanced as p is increased within a chaotic window. 
Increasing p further, the network surpasses equiprobability of patterns and, 
eventually, abandons the chaotic regime to fall into a limit cycle, where pe- 
riodically oscillates between a pattern and its antipattern. This confirms 
and details the behavior shown in figure [TJ 

This behavior, which is clearly observed in Monte Carlo simulations, can 
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also be obtained under a mean field theory. Assume, for instance, random 
patterns with ) = i^<5(^f - 1) + ^-5(^f + 1) where i^") = a, even for 
< |a| <C 1. In the simplest case of two patterns this mean field dynamics 
is determined by 

^tVi = ^^tanh[i?(^t)(7ri+7r2)] +/, i^tanh[i?(7rt)(7ri-7r2)] 
^t+i = P i^tanh[SK)(7ri+7r2)] - p tanh[5(7rt)(7ri -vr^)] 

(5) 

where B^tt) = — (1 — $)g(7r)]. It may be noticed that only in the non- 
interesting case of orthogonal patterns, namely a = 0, the mean field dy- 
namics dS]) gives chaotic switching between a particular pattern and its an- 
tipattern but not between different patterns. Otherwise, the situation is of 
chaotic switching among the stored patterns. 



4 Discussion 

This paper deals with ANN in which the density p of neurons that are 
updated at each time step is a parameter, so that the limit p ^ (1) corre- 
sponds to sequential (parallel) updating. Our main motivation is that previ- 
ous studies of ANN in these limits revealed qualitatively different behavior, 
and that analysis in which the number of updated neurons is systematically 



varied are rare in the literature, e.g., (Herz and Marcus, 1993). It is worth to 



remark also that there are several arguments which suggest studying changes 
with p. One is simply the suspicion, born outside biology, that a network 
could perhaps like to maintain inert some of the nodes during operation, 
and not necessarily for economy but in order to gain efficiency. As a matter 
of fact, as one may get convinced by oneself by looking at our expressions 
for the currents hi, hushing some of the nodes may be equivalent to mod- 
ifying the wiring topology, and this is recognized as a method to enhance 



a network efficiency (Torres et al., 2004). More specifically within biology 



one may notice that assuming cells that are stimulated only in the presence 
of a neuromodulator such as dopamine, p could stand for the fraction of 
neurons modulated each cycle. There is no input on the other 1 — p, so that 
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information from the previous state is maintained, which was argued to be 
a basis for working memories ( Egorov et al. , 2002 ; LeBeau et al. , 2005 ) . On 



the other hand, varying p may also be relevant to simulate various situa- 
tions of persistent activity ( Wagenaar et al., 2006 ), the observed variability 
of the neurons threshold ( Azouz and Gray, 2000 ), and the possible existence 
of silent neurons (Olshausen and Field, 2004; Shoham et al., 2006), for in- 
stance. 

The fact is that varying p in our model turns out to be very intriguing. 
However, p is relevant only if the network is susceptible. Such a condition 
occurs in our case as a consequence of activity-dependent fast synaptic noise 
as modelled in ([3]). The parameter p is irrelevant in other cases as, in 
particular, for the model in (Pantic et al., 2002; Pantic et al., 2003) which 
is based on the depression mechanism introduced in ( Tsodyks et al., 1998 ), 
and also when the synaptic weights are fixed, even heterogeneously as in 
a Hopfield-Hebb network. On the contrary, the model here exhibits kind 
of dynamic association, namely, the activity either goes to one attractor or 
else, for large enough visits possible attractors. The visits may abruptly 
become chaotic. Besides synchronization of a minimum of neurons, this 
requires careful tuning of p. As a matter of fact, as shown by equation (j3|) 
and figure [H a complex situation makes it difficult to predict the result for 
slight changes of p. 

Another interesting feature of our model is illustrated in figure O This 
shows time series of the mean firing rate, ra = ^ (1 -|- fjj) , in a case 
study with six patterns exposed to two different stimuli of the same intensity 
and duration (between 3000 and 4000 n Monte Carlo trials). Each pattern 
is a string of N bits. Three patterns are randomly generated with 40, 50 
and 60% of the bits set to 1, and the other three with the Is at the first 
70, 50 and 25% positions, respectively; the rest of the bits are set to —1. 
The bottom graph shows the baseline activity without stimulus (BS) and 
the activity level under stimulus p = 1 (SAl) and p = 2 (SA2), i.e., two of 
the patterns. The behavior which exhibits the system in this case (which we 
found for other parameter values as well) is amazingly alike to observations 
in a comparable (but true, not computer) experimental setting concerning 
the odor response of the projection neurons in the locust antennal lobe 



(Rabinovich et al., 2001 Mazor and Laurent, 2005). 



Interesting enough, the switching which shows our model due to stim- 
ulus destabilization in the simulation of figure [5] occurs for p < pc- In fact, 
a similar phenomenon was observed also for p ^ (Cortes et al., 2006). 
This shows that, at least in this case, an efficient adaptation to a changing 
environment does not require chaos. However, the chaotic itinerancy we de- 
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scribed above allows for a more efficient search of the attractors space in a 
way that was believed to hold in relevant systems under a critical condition 
dChialvo, 2006 ). Our model thus illustrates a mechanism that makes chaos 



extremely beneficial. This confirms expectations (Korn and Faure, 2003 



Glass, 2002 Ashwin and Timme, 2005) that the instability inherent to chaos 



facilitates moving to any pattern at any time. The present model system 
illustrates a specific mechanism which allows for this. As p increases in a 
chaotic region, it is more likely that the activity will visit all the attractors, 
not only the most correlated ones. The number and diversity of attractors 
it visits then increases with p, and we observed that the time spent in the 
attractor also varies with p. The system in this way may perform family 
discrimination and classification by tuning p. We finally remark that our 
model allows for describing a coupling of p to the activity, which may be 
quite a realistic setting in some cases. No doubt it would be interesting to 
study other related model situations. 

We thank I. Erchova, P.L. Garrido and H.J. Kappen for very useful com- 
ments, and financial support from FEDER-MEC project FIS2005-00791, JA 
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Figure Captions 



Figure 1: The Lyapunov exponent (solid curve), showing transitions from 
regular (A < 0) to chaotic (A > 0) as the synchronization parameter p = 
n/N is varied, as obtained analytically from the saddle-point solution for 
$ = 0.005, M = 1 patterns, and (5 = 50. The chaotic windows here were 
precisely confirmed using related Monte Carlo simulations with = 3600 
neurons. The minimum fraction of active neurons needed to start the period- 
doubling route to chaotic behavior, />c, is shown. This picture is strongly 
dependent on there is a rather broad range of ^ values, including negative 
ones, for which the behavior is qualitatively similar. The dashed curve is the 
Hopfield-Hebb case $ = 1. The inset details the interesting region showing 
chaotic behavior. 

Figure 2: The overlap as a function of time (in units of n Monte Carlo 
trials) after t = 1920, for N = 1600, /3 = 20, $ = -0.4, M = 3 uncorrected 
patterns and, from bottom to top, p = 0.08, 0.50, 0.65, 0.92 and 1.00, 
respectively. In this case, pc = 0.085. 

Figure 3: Time variation of the mean firing rate m = (1 -|-7r)/2 in an attrac- 
tor neural network which stores a single pattern with depressing synapses, 
as modeled in ( Tsodyks et al., 1998 Pantic et al., 2002 ), under partial up- 
dating in the oscillatory regime. Panels show, from top to bottom, the 
cases p = 1,0.7,0.3,0.1. This (which corresponds to certain model param- 
eters) reveals that, except for scaling of the typical temporal scale for the 
oscillations, partial updating does not introduce new phenomenology in this 
model, contrary to the case presented in this paper. 

Figure 4: This compares the gain function in the model in this paper, 
for /O = 1 and varying <I> (left panel) and the gain function in the model in 
( Torres et al., 2002) for varying 7 (right panel). In both cases (3 was set to 3. 
Different curves in the left case are for $ = 1 (non-depressed case), 0.6, 0.2 
and (hight depression); the curves in the right case occur when the corre- 
sponding parameter 7 = (non-depressed case), 0.5, 3, 10 (high depression case) 
This shows how the gain function can be non-monotonic for some values of 
the depression parameter # in the model in this paper. This allows for non- 
zero fixed point solutions, namely, the points that intersect the diagonal, 
with negative slopes (whose absolute value is larger than one) which leads 
to a period-doubling route to chaos. 

Figure 5: Itinerancy induced by external stimuli. Mean firing rates as a 
function of time (bottom) and phase-space trajectories (top) trying to recre- 
ate an experimental observation concerning odor responses ( |Mazor and Laurent, 2005 ). 
The graphs show two Monte Carlo simulations of our system with = 1600, 
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/5 = 4, <^ = —0.45, p = 3/64 < pc, and six stored patterns, for different stim- 
uli, corresponding to green and red colors, respectively. The top graph in- 
volves a standard false-neighbor method ( jEckmann and Ruelle, 1985D with 
embedding dimension de = 5, and the time delay is r = 20. 
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Figure 1: Torres et al. 
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Figure 2: Torres et al. 
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Figure 3: Torres et al. 
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Figure 4: Torres et al. 
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Figure 5: Torres et al. 



19 



